Learning Rules for Chinese Prosodic Phrase Prediction

نویسندگان

  • Sheng Zhao
  • Jianhua Tao
  • Lianhong Cai
چکیده

This paper describes a rule-learning approach towards Chinese prosodic phrase prediction for TTS systems. Firstly, we prepared a speech corpus having about 3000 sentences and manually labelled the sentences with two-level prosodic structure. Secondly, candidate features related to prosodic phrasing and the corresponding prosodic boundary labels are extracted from the corpus text to establish an example database. A series of comparative experiments is conducted to figure out the most effective features from the candidates. Lastly, two typical rule learning algorithms (C4.5 and TBL) are applied on the example database to induce prediction rules. The paper also suggests general evaluation parameters for prosodic phrase prediction. With these parameters, our methods are compared with RNN and bigram based statistical methods on the same corpus. The experiments show that the automatic rule-learning approach can achieve better prediction accuracy than the non-rule based methods and yet retain the advantage of the simplicity and understandability of rule systems. Thus it is justified as an effective alternative to prosodic phrase prediction.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Prosodic phrasing with inductive learning

Prosodic phrasing is an important component in modern TTS systems, which inserts natural and reasonable breaks into long utterance. This paper reports the study of applying several inductive machine-learning algorithms to prosodic phrasing in unrestricted Chinese texts. Two feature sets are carefully selected considering the effectiveness and reliability of them in practice. Then features and t...

متن کامل

Prosody prediction for speech synthesis using transformational rule-based learning

Prediction of symbolic prosodic labels (pitch accents and phrase structure) is an important step in generating natural synthetic speech. This paper investigates a new automatically trainable procedure for combined accent and phrase prediction based on transformational rule-based learning. Experimental results on a radio news corpus show that accent prediction bene ts from phrase structure, but ...

متن کامل

Prosodic Phrase Detection for Chinese Tts Using Cart and Statistical Model

Determination of prosodic phrase break from text is one of the important problems in generating good prosody for Chinese text-to-speech system. In this paper, we propose a statistical approach for detecting prosodic phrase breaks. Part-of-speech sequence information is used as the primary information. The history of the previous breaks is considered as constraint in this work. The probabilities...

متن کامل

Prosodic Fillers and Discourse Markers–Discourse Prosody and Text Prediction

Mandarin Chinese fluent speech prosody is characterized by a hierarchical multiple-phrase structure that specifies how speech paragraphs are constituted via Prosodic Phrase Grouping. Hence we view spoken discourse prosody as yet another higher node treats PGs (Prosodic Phrase Groups) as sister constituents. The goals of present study are two fold: one is to study how speech paragraphs are conne...

متن کامل

Combining models of prosodic phrasing and pausing

This paper describes two approaches to assigning prosodic phrase structure and pauses to text and investigates the impact of errors in the assignments for different granularities of prosodic phrase structure. One approach uses a cascaded combination of models trained separately for prediction of prosodic phrase structure and pauses and the other uses a model trained for the joint prediction tas...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002